NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Tracking live-cell single-molecule dynamics enables measurements of heterochromatin-associated protein–protein interactions

https://doi.org/10.1093/nar/gkae692

Chen, Ziyuan; Seman, Melissa; Fyodorova, Yekaterina; Farhat, Ali; Ames, Amanda; Levashkevich, Alexander; Biswas, Saikat; Huang, Fengting; Freddolino, Lydia; Biteen, Julie S.; et al (August 2024, Nucleic Acids Research)

Abstract Visualizing and measuring molecular-scale interactions in living cells represents a major challenge, but recent advances in single-molecule super-resolution microscopy are bringing us closer to achieving this goal. Single-molecule super-resolution microscopy enables high-resolution and sensitive imaging of the positions and movement of molecules in living cells. HP1 proteins are important regulators of gene expression because they selectively bind and recognize H3K9 methylated (H3K9me) histones to form heterochromatin-associated protein complexes that silence gene expression, but several important mechanistic details of this process remain unexplored. Here, we extended live-cell single-molecule tracking studies in fission yeast to determine how HP1 proteins interact with their binding partners in the nucleus. We measured how genetic perturbations that affect H3K9me alter the diffusive properties of HP1 proteins and their binding partners, and we inferred their most likely interaction sites. Our results demonstrate that H3K9 methylation spatially restricts HP1 proteins and their interactors, thereby promoting ternary complex formation on chromatin while simultaneously suppressing off-chromatin binding. As opposed to being an inert platform to direct HP1 binding, our studies propose a novel function for H3K9me in promoting ternary complex formation by enhancing the specificity and stimulating the assembly of HP1–protein complexes in living cells.
more » « less
DEMO-EM2: assembling protein complex structures from cryo-EM maps through intertwined chain and domain fitting

https://doi.org/10.1093/bib/bbae113

Zhang, Ziying; Cai, Yaxian; Zhang, Biao; Zheng, Wei; Freddolino, Lydia; Zhang, Guijun; Zhou, Xiaogen (March 2024, Briefings in Bioinformatics)

Abstract The breakthrough in cryo-electron microscopy (cryo-EM) technology has led to an increasing number of density maps of biological macromolecules. However, constructing accurate protein complex atomic structures from cryo-EM maps remains a challenge. In this study, we extend our previously developed DEMO-EM to present DEMO-EM2, an automated method for constructing protein complex models from cryo-EM maps through an iterative assembly procedure intertwining chain- and domain-level matching and fitting for predicted chain models. The method was carefully evaluated on 27 cryo-electron tomography (cryo-ET) maps and 16 single-particle EM maps, where DEMO-EM2 models achieved an average TM-score of 0.92, outperforming those of state-of-the-art methods. The results demonstrate an efficient method that enables the rapid and reliable solution of challenging cryo-EM structure modeling problems.
more » « less
Improving deep learning protein monomer and complex structure prediction using DeepMSA2 with huge metagenomics data

https://doi.org/10.1038/s41592-023-02130-4

Zheng, Wei; Wuyun, Qiqige; Li, Yang; Zhang, Chengxin; Freddolino, Lydia; Zhang, Yang (January 2024, Nature Methods)

Abstract Leveraging iterative alignment search through genomic and metagenome sequence databases, we report the DeepMSA2 pipeline for uniform protein single- and multichain multiple-sequence alignment (MSA) construction. Large-scale benchmarks show that DeepMSA2 MSAs can remarkably increase the accuracy of protein tertiary and quaternary structure predictions compared with current state-of-the-art methods. An integrated pipeline with DeepMSA2 participated in the most recent CASP15 experiment and created complex structural models with considerably higher quality than the AlphaFold2-Multimer server (v.2.2.0). Detailed data analyses show that the major advantage of DeepMSA2 lies in its balanced alignment search and effective model selection, and in the power of integrating huge metagenomics databases. These results demonstrate a new avenue to improve deep learning protein structure prediction through advanced MSA construction and provide additional evidence that optimization of input information to deep learning-based structure prediction methods must be considered with as much care as the design of the predictor itself.
more » « less
BioLiP2: an updated structure database for biologically relevant ligand–protein interactions

https://doi.org/10.1093/nar/gkad630

Zhang, Chengxin; Zhang, Xi; Freddolino, Lydia; Zhang, Yang (July 2023, Nucleic Acids Research)

Abstract With the progress of structural biology, the Protein Data Bank (PDB) has witnessed rapid accumulation of experimentally solved protein structures. Since many structures are determined with purification and crystallization additives that are unrelated to a protein's in vivo function, it is nontrivial to identify the subset of protein–ligand interactions that are biologically relevant. We developed the BioLiP2 database (https://zhanggroup.org/BioLiP) to extract biologically relevant protein–ligand interactions from the PDB database. BioLiP2 assesses the functional relevance of the ligands by geometric rules and experimental literature validations. The ligand binding information is further enriched with other function annotations, including Enzyme Commission numbers, Gene Ontology terms, catalytic sites, and binding affinities collected from other databases and a manual literature survey. Compared to its predecessor BioLiP, BioLiP2 offers significantly greater coverage of nucleic acid-protein interactions, and interactions involving large complexes that are unavailable in PDB format. BioLiP2 also integrates cutting-edge structural alignment algorithms with state-of-the-art structure prediction techniques, which for the first time enables composite protein structure and sequence-based searching and significantly enhances the usefulness of the database in structure-based function annotations. With these new developments, BioLiP2 will continue to be an important and comprehensive database for docking, virtual screening, and structure-based protein function analyses.
more » « less
LOMETS3: integrating deep learning and profile alignment for advanced protein template recognition and function annotation

https://doi.org/10.1093/nar/gkac248

Zheng, Wei; Wuyun, Qiqige; Zhou, Xiaogen; Li, Yang; Freddolino, Lydia; Zhang, Yang (April 2022, Nucleic Acids Research)

Abstract Deep learning techniques have significantly advanced the field of protein structure prediction. LOMETS3 (https://zhanglab.ccmb.med.umich.edu/LOMETS/) is a new generation meta-server approach to template-based protein structure prediction and function annotation, which integrates newly developed deep learning threading methods. For the first time, we have extended LOMETS3 to handle multi-domain proteins and to construct full-length models with gradient-based optimizations. Starting from a FASTA-formatted sequence, LOMETS3 performs four steps of domain boundary prediction, domain-level template identification, full-length template/model assembly and structure-based function prediction. The output of LOMETS3 contains (i) top-ranked templates from LOMETS3 and its component threading programs, (ii) up to 5 full-length structure models constructed by L-BFGS (limited-memory Broyden–Fletcher–Goldfarb–Shanno algorithm) optimization, (iii) the 10 closest Protein Data Bank (PDB) structures to the target, (iv) structure-based functional predictions, (v) domain partition and assembly results, and (vi) the domain-level threading results, including items (i)–(iii) for each identified domain. LOMETS3 was tested in large-scale benchmarks and the blind CASP14 (14th Critical Assessment of Structure Prediction) experiment, where the overall template recognition and function prediction accuracy is significantly beyond its predecessors and other state-of-the-art threading approaches, especially for hard targets without homologous templates in the PDB. Based on the improved developments, LOMETS3 should help significantly advance the capability of broader biomedical community for template-based protein structure and function modelling.
more » « less
Integrating deep learning, threading alignments, and a multi‐MSA strategy for high‐quality protein monomer and complex structure prediction in CASP15

https://doi.org/10.1002/prot.26585

Zheng, Wei; Wuyun, Qiqige; Freddolino, Lydia; Zhang, Yang (August 2023, Proteins: Structure, Function, and Bioinformatics)

Abstract We report the results of the “UM‐TBM” and “Zheng” groups in CASP15 for protein monomer and complex structure prediction. These prediction sets were obtained using the D‐I‐TASSER and DMFold‐Multimer algorithms, respectively. For monomer structure prediction, D‐I‐TASSER introduced four new features during CASP15: (i) a multiple sequence alignment (MSA) generation protocol that combines multi‐source MSA searching and a structural modeling‐based MSA ranker; (ii) attention‐network based spatial restraints; (iii) a multi‐domain module containing domain partition and arrangement for domain‐level templates and spatial restraints; (iv) an optimized I‐TASSER‐based folding simulation system for full‐length model creation guided by a combination of deep learning restraints, threading alignments, and knowledge‐based potentials. For 47 free modeling targets in CASP15, the final models predicted by D‐I‐TASSER showed average TM‐score 19% higher than the standard AlphaFold2 program. We thus showed that traditional Monte Carlo‐based folding simulations, when appropriately coupled with deep learning algorithms, can generate models with improved accuracy over end‐to‐end deep learning methods alone. For protein complex structure prediction, DMFold‐Multimer generated models by integrating a new MSA generation algorithm (DeepMSA2) with the end‐to‐end modeling module from AlphaFold2‐Multimer. For the 38 complex targets, DMFold‐Multimer generated models with an average TM‐score of 0.83 and Interface Contact Score of 0.60, both significantly higher than those of competing complex prediction tools. Our analyses on complexes highlighted the critical role played by MSA generating, ranking, and pairing in protein complex structure prediction. We also discuss future room for improvement in the areas of viral protein modeling and complex model ranking.
more » « less
Protein target highlights in CASP15 : Analysis of models by structure providers

https://doi.org/10.1002/prot.26545

Alexander, Leila_T; Durairaj, Janani; Kryshtafovych, Andriy; Abriata, Luciano_A; Bayo, Yusupha; Bhabha, Gira; Breyton, Cécile; Caulton, Simon_G; Chen, James; Degroux, Séraphine; et al (July 2023, Proteins: Structure, Function, and Bioinformatics)

Abstract We present an in‐depth analysis of selected CASP15 targets, focusing on their biological and functional significance. The authors of the structures identify and discuss key protein features and evaluate how effectively these aspects were captured in the submitted predictions. While the overall ability to predict three‐dimensional protein structures continues to impress, reproducing uncommon features not previously observed in experimental structures is still a challenge. Furthermore, instances with conformational flexibility and large multimeric complexes highlight the need for novel scoring strategies to better emphasize biologically relevant structural regions. Looking ahead, closer integration of computational and experimental techniques will play a key role in determining the next challenges to be unraveled in the field of structural molecular biology.
more » « less

Search for: All records